MMsINC®: A New Public Large-Scale Chemoinformatics Database System
نویسندگان
چکیده
MMSinc is a database of commercially available compounds. It currently contains over 4 million /non-redundant/ chemical compounds in 3D format. The whole database was studied in term of uniqueness, diversity, frameworks, chemical reactivity, drug-like and lead-like properties. There are more than 175.000 frameworks in our database. There are 3.89 millions (98%) of drug-like molecules among which more than 3.61 millions (91%) are lead-like. Moreover, 3.45 million (87%) are considered chemically stable compounds. The druglikeness and leadlikeness are estimated using Lipinski and Oprea cutoff values. The compounds are stored in a PostgreSQL database and the code to manage this database is in Java. Moreover, MMsINC is nicely integrated with PubChem and PDB databases facilitating the cross exchange of ligand information. We are developing tools for efficient database access and analysis, for virtual screening and chemoinformatic applications. MMsINC is accessible at the following web address:
منابع مشابه
MMsINC: a large-scale chemoinformatics database
MMsINC (http://mms.dsfarm.unipd.it/MMsINC/search) is a database of non-redundant, richly annotated and biomedically relevant chemical structures. A primary goal of MMsINC is to guarantee the highest quality and the uniqueness of each entry. MMsINC then adds value to these entries by including the analysis of crucial chemical properties, such as ionization and tautomerization processes, and the ...
متن کاملAssessment of "drug-likeness" of a small library of natural products using chemoinformatics
Even though natural products has an excellent record as a source for new drugs, the advent of ultrahigh-throughput screening and large-scale combinatorial synthetic methods, has caused a decline in the use of natural products research in the pharmaceutical industry. This is due to the efficiency in generating and screening a high number of synthetic combinatorial compounds; whereas traditional ...
متن کاملChemDB: a public database of small molecules and related chemoinformatics resources
MOTIVATION The development of chemoinformatics has been hampered by the lack of large, publicly available, comprehensive repositories of molecules, in particular of small molecules. Small molecules play a fundamental role in organic chemistry and biology. They can be used as combinatorial building blocks for chemical synthesis, as molecular probes in chemical genomics and systems biology, and f...
متن کاملA New Approach for Knowledge Based Systems Reduction using Rough Sets Theory (RESEARCH NOTE)
Problem of knowledge analysis for decision support system is the most difficult task of information systems. This paper presents a new approach based on notions of mathematical theory of Rough Sets to solve this problem. Using these concepts a systematic approach has been developed to reduce the size of decision database and extract reduced rules set from vague and uncertain data. The method ha...
متن کاملApplication of Information - Theoretic Concepts in Chemoinformatics
The use of computational methodologies for chemical database mining and molecular similarity searching or structure-activity relationship analysis has become an integral part of modern chemical and pharmaceutical research. These types of computational studies fall into the chemoinformatics spectrum and usually have large-scale character. Concepts from information theory such as Shannon entropy ...
متن کامل